Integrated Tools For Genomic Information Analysis

نویسندگان

  • João Adriano A. Freitas
  • Felipe S. Chambergo
  • Ari J. S. Ferreira
  • Hamza El Dorry
چکیده

Genomic information provides the first stage in understanding the instructions for the synthesis of all of life's molecular machines and the systems needed to control and operate them. The interaction among biological, physical and computing sciences has provided the bases for the creation of a great number of bioinformatic tools [1]. Our laboratory uses the multicellular microorganism Trichoderma reesei as a model system to study the genomic regulatory networks for energy production in eukaryotes. During the developing of an “Expressed Sequence Tags” (ESTs) database for T. reesei [2], we achieved a vast amount of information, making it almost impossible to handle all the data obtained without the use of computational tools. A software methodology for genomic information analysis was developed based on the Integration Definition for Function Modeling [3] (IDEF) approach and the results are fully available at http://trichoderma.iq.usp.br. The IDEF was developed by the United States Air Force and has been used to model a wide variety of automated and non-automated systems. The obtained flux diagrams detail each step of the data analyses, which include: (i) quality analyses of electrophoretograms by the Phred software [4]; (ii) sequence clustering by the Phrap software [5]; (iii) gene identification by the Blast client software, provided by NCBI [6]; (iv) and classification of the genes according to their putative function. Phred software reads DNA sequencer trace data, calls bases, assigns quality values to the bases and writes the base calls and quality values to files in either FASTA or PHD format. Phrap software is a program to assemble shotgun DNA sequence data using the PHD files generated by Phred. The Blast client software compares sequences against the GenBank non-redundant protein database. Based on this methodology, UML [7] structures wereestablished to help specify, visualize and document the softwaresystem, including its structure and design. Thus, researchers withlittle genetic expertise from biological, physical and computingscience can achieve technical understanding of the genomicanalysis software. REFERENCES[1]Setubal, João Carlos, Medanis, João, “Introduction tocomputational molecular biology”, PWS Pub., Boston, 1997 [2]Chambergo, Felipe S. et al., “Elucidation of the metabolic fateof glucose in the filamentous fungus Trichoderma reeseiusing expressed sequence tag (EST) analysis and cDNAmicroarrays”, Journal of Biological Chemistry, 277(16), p.13983-13988, 2002 [3]IDEF Family of Methods, http://www.idef.com/idef0.html[4]Green, Phil, Ewing, Brent, “PHRED Documentation”,http://www.phrap.org/phrap.docs/phred.html [5]Green, Phil, “Documentation for PHRAP andCROSS_MATCH”,http://www.phrap.org/phrap.docs/phrap.html [6]National Center for Biotechnology Information,http://www.ncbi.nlm.nih.gov/BLAST/ [7]Unified Modeling Language, http://www.uml.org

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

برآورد صحت انتخاب ژنومی در جوامع کوچک ژنتیکی- مطالعه‌ شبیه‌سازی

In the present study two genetically connected small and large populations were simulated and the effect of different sources of information from foreign populations on the accuracy of predicted genomic breeding values of young animals of the small population was investigated. A large population consist of 200000 animals over 15 generations and a small population consist of 5000 animals over 3 ...

متن کامل

Measuring user's emotional experience in two tools using an integrated method based on task analysis

Background and Objectives: Human factors are critical in the research and development of various products, but the wideness and unclarity of different aspects of these factors make it difficult to extract the demands of users. Studying emotional aspects of users' behavior with respect to products, as well as interviewing them, is considered an efficient tool for extracting design demands. Me...

متن کامل

GenGIS: A geospatial information system for genomic data.

The increasing availability of genetic sequence data associated with explicit geographic and ecological information is offering new opportunities to study the processes that shape biodiversity. The generation and testing of hypotheses using these data sets requires effective tools for mathematical and visual analysis that can integrate digital maps, ecological data, and large genetic, genomic, ...

متن کامل

Biodefense Oriented Genomic-Based Pathogen Classification Systems: Challenges and Opportunities

Countermeasures that will effectively prevent or diminish the impact of a biological attack will depend on the rapid and accurate generation and analysis of genomic information. Because of their increasing level of sensitivity, rapidly decreasing cost, and their ability to effectively interrogate the genomes of previously unknown organisms, Next Generation Sequencing (NGS) technologies are revo...

متن کامل

Web Portal for Genomic and Epidemiologic Medical Data

Medical data and digital imaging for medical diagnosis currently represent a very important research area in computer science. The generation of medical information is continuously increasing. More specifically, genomic (molecular and histological) data and images have become key points for diagnosis. The specific processing these data require is more and more requested. This article describes ...

متن کامل

PUMA2—grid-based high-throughput analysis of genomes and metabolic pathways

The PUMA2 system (available at http://compbio.mcs.anl.gov/puma2) is an interactive, integrated bioinformatics environment for high-throughput genetic sequence analysis and metabolic reconstructions from sequence data. PUMA2 provides a framework for comparative and evolutionary analysis of genomic data and metabolic networks in the context of taxonomic and phenotypic information. Grid infrastruc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003